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Abstract — Given a multiple-input multiple-output (MIMO) 
channel, feedback from the receiver can be used to specify 
a transmit precoding matrix, which selectively activates the 
strongest channel modes. Here we analyze the performance of 
Random Vector Quantization (RVQ), in which the precoding ma- 
trix is selected from a random codebook containing independent, 
isotropically distributed entries. We assume that channel elements 
are i.i.d. and known to the receiver, which relays the optimal 
(rate-maximizing) precoder codebook index to the transmitter 
using B bits. We first derive the large system capacity of 
beamforming (rank-one precoding matrix) as a function of B, 
where large system refers to the limit as B and the number 
of transmit and receive antennas all go to infinity with fixed 
ratios. RVQ for beamforming is asymptotically optimal, i.e., no 
other quantization scheme can achieve a larger asymptotic rate. 
We subsequently consider a precoding matrix with arbitrary 
rank, and approximate the asymptotic RVQ performance with 
optimal and linear receivers (matched filter and Minimum Mean 
Squared Error (MMSE)). Numerical examples show that these 
approximations accurately predict the performance of finite-size 
systems of interest. Given a target spectral efficiency, numerical 
examples show that the amount of feedback required by the 
linear MMSE receiver is only slightly more than that required 
by the optimal receiver, whereas the matched filter can require 
significantly more feedback. 

Index Terms — Beamforming, large system analysis, limited 
feedback, Multi-Input Multi-Output (MIMO), precoding, vector 
quantization. 



I. Introduction 

GIVEN a multi-input multi-output (MIMO) channel, pro- 
viding channel information at the transmitter can in- 
crease the achievable rate and simplify the coder and decoder. 
Namely, this channel information can specify a precoding 
matrix, which aligns the transmitted signal along the strongest 
channel modes (i.e., singular vectors corresponding to the 
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largest singular values). In practice, the precoding matrix must 
be quantized at the receiver, and relayed to the transmitter 
via a feedback channel. The corresponding achievable rate is 
therefore limited by the accuracy of the quantizer. 

The design and performance of quantized precoding matri- 
ces for multi-input single-output (MISO) and MIMO channels 
has been considered in numerous references, including [1]- 
[12]. In those references, and in this paper, the channel is 
assumed to be stationary, known at the receiver, and the perfor- 
mance is evaluated as a function of the number of quantization 
bits B. (This is in contrast with other work, which models 
estimation error at the receiver, but does not explicitly account 
for quantization error (e.g., [13], [14]), and which assumes a 
time-varying channel with feedback of second-order statistics 
[15]— [18].) Optimization of vector quantization codebooks is 
discussed in [l]-[3], [5], [6] for beamforming, and in [4], 
[7], [12] for MIMO channels with precoding matrices that 
provide multiplexing gain (i.e., have rank larger than one). It 
is shown in [2], [3] that this optimization can be interpreted 
as maximizing the minimum distance between points in a 
Grassmannian space. (See also [9].) The performance of this 
class of Grassmannian codebooks is also studied in [8]— [10]. 

In this paper, we evaluate the performance of a Random 
Vector Quantization (RVQ) scheme for the precoding matrix. 
Namely, given B feedback bits, the precoding matrix is 
selected from a random codebook containing 2 B matrices, 
which are independent and isotropically distributed. RVQ has 
been analyzed in other source coding contexts (e.g., see [19] 
and the related discussion in [20]), and achieves the rate- 
distortion bound for ergodic Gaussian sources. This work is 
motivated by prior work [21] in which RVQ is considered 
for signature quantization in Code-Division Multiple Access 
(CDMA). In that scenario, limited feedback is used to select a 
signature for a particular user, which maximizes the received 
Signal-to-interference-Plus-Noise-Ratio (SINR). RVQ has the 
attractive properties of being tractable and asymptotically 
optimal. Namely, in [21] the received SINR with RVQ is 
evaluated in the asymptotic (large system) limit as processing 
gain, number of users, and feedback bits all tend to infinity 
with fixed ratios. Furthermore, it is shown that no other 
quantization scheme can achieve a larger asymptotic SINR. 

Here we assume an i.i.d. block Rayleigh fading channel 
model with independent channel gains, and take ergodic 
capacity as the performance criterion. The receiver relays B 
bits to the transmitter (per codeword) via a reliable feedback 
channel (i.e., no feedback errors) with no delay. We start by 
evaluating the capacity of MISO and MIMO channels with a 
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quantized beamformer, i.e., rank-one precoding matrix. Our 
results are asymptotic as the number of transmit antennas N t 
and feedback bits B both tend to infinity with fixed B/N t 
(feedback bits per degree of freedom). For the MIMO channel 
the number of receive antennas N r also tends to infinity 
in proportion with Nt and B. The asymptotic expressions 
accurately predict the performance of finite-size systems of 
interest as a function of normalized feedback and background 
Signal-to-Noise Ratio (SNR). In analogy with the optimality 
result shown in [21], RVQ is also asymptotically optimal in 
this scenario, i.e., no other quantization scheme can achieve 
a larger asymptotic rate. Furthermore, numerical examples for 
small Nt show that RVQ performance averaged over code- 
books is essentially the same as that obtained from codebooks 
optimized via the Lloyd-Max algorithm [1], [5], [6]. (See 
also the numerical examples in [22], which compare RVQ 
performance with the optimized (Grassmannian) codebooks 
in [3].) 

We then consider quantization of a precoding matrix with 
arbitrary rank. Namely, a rank K precoding matrix multiplexes 
K independent streams of transmitted information symbols 
onto the Nt transmit antennas. In that case, the capacity with 
limited feedback is approximated in the limit as B, Nt, N r , 
and K all tend to infinity with fixed ratios K/Nt, N r /Nt, 
and B/N 2 . That is, the number of feedback bits again scales 
linearly with the number of degrees of freedom, which is pro- 
portional to N%. Although our results for beamforming suggest 
that RVQ is also asymptotically optimal in this scenario, this 
remains an open question. 

The asymptotic results for a precoder matrix with arbitrary 
rank K can be used to determine the normalized rank, or 
multiplexing gain K/Nt, which maximizes the capacity. This 
optimized rank in general depends on the normalized feedback, 
the ratio of antennas Nt/N r , and the SNR. For example, if 
Nt/N r > 1 and the SNR is sufficiently large, then as the 
feedback increases from zero to infinity, the optimized rank 
decreases from one to N r /N t . Numerical results are presented, 
which illustrate the effect of normalized rank on achievable 
rate, and also show that the asymptotic results accurately 
predict simulated results for finite-size systems of interest. 

We also evaluate the performance of RVQ with linear 
receivers (i.e., the matched filter and linear Minimum Mean 
Squared Error (MMSE) receivers), and compare their perfor- 
mance with the optimal (capacity-achieving) receiver. With the 
optimal precoding matrix, corresponding to infinite feedback, 
both linear receivers are optimal. With limited feedback the 
two linear receivers are simpler than the optimal receiver, 
but require more feedback to achieve a target rate. Numerical 
results show that this additional feedback required by the linear 
MMSE receiver is quite small, whereas the additional feedback 
required by the matched filter can be significant (e.g., about 
one bit per precoding matrix element). 

In addition to quantizing the optimal precoding matrix, 
power for each data stream can also be optimized, quantized, 
and fed back to the transmitter (e.g., see [23], [24]). Asymptot- 
ically, the amount of feedback required to specify the power is 
negligible compared to the feedback required for the precoding 
matrix. Furthermore, uniform power over the set of activated 



channel typically performs close to the optimal (water-filling) 
performance [8]. We therefore only consider quantization of 
the precoding matrix. 

Other related work on RVQ for MIMO channels has been 
presented in [25]-[27]. Namely, exact expressions for the 
ergodic capacity with beamforming and RVQ for a finite-size 
MISO channel are derived in [25]. The performance of RVQ 
for precoding over a broadcast MIMO channel is analyzed 
in [26], [27]. A closely related random beamforming scheme 
for the multiuser MIMO broadcast channel was previously 
presented in [28]. In that work, the growth in sum capacity is 
characterized asymptotically as the number of users becomes 
large with a fixed number of antennas. (Random beamforming 
was previously proposed in [29], although there the main focus 
is to improve fairness among users.) 

The paper is organized as follows. Section [TT] describes 
the channel model, Section [III] considers the capacity of 
beamforming with limited feedback, and sections [IV] and [V] 
examine the capacity of a quantized precoding matrix with 
optimal and linear receivers, respectively. Derivations of the 
main results are given in the appendices. 

II. Channel Model 

We consider a point-to-point, flat Rayleigh fading channel 
with N transmit antennas and N r receive antennas. Let x = 
[xk] be a K x 1 vector of transmitted symbols with covariance 
matrix Ik, where Ik is the K x K identity matrix, and K is 
the number of independent data streams. The received N r x 1 
vector is given by 

y = -^=HVx + n (1) 



K 

where H = [h nr . int ] is an N r x Nt channel matrix, V = 
[vi i>2 • • ■ vk] is an Nt x K precoding matrix, and n is a 
complex Gaussian noise JV r xl vector with covariance matrix 
cr^Jjv r . Assuming rich scattering and Rayleigh fading, the 
elements of H are independent, and the channel coefficient be- 
tween the n t th transmit antenna and the n r th receive antenna, 
h nrtnt , is a circularly symmetric complex Gaussian random 
variable with zero mean and unit variance (E[\h nrtrH \ 2 ) = 1). 

We assume i.i.d. block fading, i.e., the channel is static 
within a fading block, and the channels across blocks are 
independent. The ergodic capacity is achieved by coding the 
transmitted symbols across an infinitely large number of fading 
blocks. With perfect channel knowledge at the receiver and 
a given precoding matrix V, the ergodic capacity is the 
mutual information between x and y with a complex Gaussian 
distributed input, averaged over the channel, given by 

—HVV^hA] (2) 

K J - 



(x;y) 



E 



H 



logdet 



where p = l/cr„ is the background SNR. We wish to 
specify the precoding matrix V that maximizes the mutual 
information, subject to a power constraint ||«fc|| < 1, for 
1 <k < K. 

With unlimited feedback, the columns of the optimal pre- 
coding matrix, which maximizes (O, are eigenvectors of the 
channel covariance matrix H^H. With B feedback bits per 
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fading block, we can specify the precoding matrix from a 
quantization set or codebook V = {Vi,-- - ,V 2 b} known 
a priori to both the transmitter and receiver. The receiver 
chooses the Vj that maximizes the sum mutual information, 
and relays the corresponding index back to the transmitter. 
Of course, the performance (ergodic capacity) depends on the 
codebook V. 

III. Beamforming with Limited Feedback 

We start with a rank-one precoding matrix, corresponding 
to a single data stream (K = 1). In that case, the precoding 
matrix is specified by an N t x 1 beamforming vector v, which 
ideally corresponds to the strongest channel mode. That is, the 
optimal v, which maximizes the ergodic capacity in (fJJ, is the 
eigenvector of H corresponding to the largest eigenvalue. 
This vector is computed at the receiver and a quantized version 
is relayed back to the transmitter. 

Let V = {vi, . . . , v 2 b} denote the quantization codebook 
for v, given B feedback bits. Optimization of this codebook 
has been considered in [2], [3], [6] with outage capacity and 
ergodic capacity as performance metrics. The performance of 
an optimized codebook is difficult to evaluate exactly, and 
is approximated in [2], [3], [6], [9]— [ 11]. Here we consider 
RVQ in which Vi,---,v 2 b are independent, isotropically 
distributed random vectors, each with unit norm. This is 
motivated by the observation that given a channel matrix H 
with Ltd. elements, the eigenvectors of H are isotropi- 
cally distributed [30], hence the codebook entries should be 
uniformly distributed over the space of beamforming vectors. 

A. MISO Channel 

We first consider a MISO channel, corresponding to a single 
receive antenna (N r — 1). In that case, H is an Nt x 1 channel 
vector, which we denote as h. The optimal beamformer, which 
maximizes the mutual information in (2), is the normalized 
channel vector and the corresponding mutual informa- 

tion is S/ l [log(l + ph'h)]. The receiver selects the quantized 
precoding vector to maximize the mutual information, i.e., 

v = arg max {ij — log(l + p\h/Vj\ 2 )} (3) 

1<j<2 s 

and the corresponding achievable rate is 

!„,„ = max h. 



'rvq 



l<j<2 B 



(4) 



where the superscript Nt denotes the system size. The achiev- 
able rate depends on the codebook V and the channel vector 
h, and is therefore random. Rather than averaging I^i over 
V and h to find the ergodic capacity, we instead evaluate the 
limiting performance as Nt and B tend to infinity with fixed 
B = B /N (feedback bits per transmit antenna). In this limit, 
Jrvq converges to a deterministic constant. This is illustrated 
in Fig. Q] which shows the pdf of |/i^t)| 2 /||ft,|| 2 for different 
N t with no feedback (B = 0), and for RVQ with B = 2. The 
figure shows that convergence of the pdf to a point mass is 
faster with feedback than without. 

As N t — > oo, (h'h)/Nt — ► 1 almost surely, so that 
log(l + ph^h) — \og(pN t ) — > 0. That is, with perfect channel 
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Fig. 1. pdf of |/i+6| 2 /U7i|| 2 with RVQ for different values of Nt. 



knowledge at the transmitter, the ergodic capacity increases as 
log(pNt). With finite feedback there is a rate loss, which is 
defined as 

'rvq = - l0g(pN t ). (5) 

For finite Nt, 7 rV q is random; however, in the large system 
limit /rvq converges to a deterministic constant. 

Theorem 1. As (N t ,B) — > oo with fixed B = B/N t , the rate 



difference I^q converges in the mean square sense to 



Z% = log(l - 2"") 



-B\ 



(6) 



The proof is given in Appendix lAl For B > 0, the rate loss 
due to finite feedback is a constant. As B — > 0, this rate loss 
tends to infinity, since with B — 0, the capacity tends to a 
constant as Nt — * oo, whereas the capacity grows as logA^ 
for B > 0. Of course, as B — > oo (unlimited feedback), the 
rate loss vanishes. 

RVQ is asymptotically optimal in the following sense. 
Suppose that {VV t } is an arbitrary sequence of codebooks 
for the beamforming vector where 



is the codebook for a particular Nt and \\v 
j. The associated rate is given by 

I Vn = max log(l + p\h)v\ 

\<j<2 B J 



N t \\2 



(7) 

1 for each 



Nt\2\ 



(8) 



and the rate difference I Vn = Jy Nt — log(pN t ). 
Theorem 2. For any sequence of codebooks {V_/v t }, 



lim sup Eh [L 

{N t ,B)^oc 



A 1 < T A 
V Nt J - X ™<7- 



(9) 



The proof is given in Appendix IB1 

Although the optimality of RVQ holds only in the large 
system limit, numerical results in Section IIII-CI show that for 
finite-size systems of interest RVQ performs essentially the 
same as optimized quantization codebooks. 
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B. Multi-Input Multi-Output (MIMO) Channel 

We now consider quantized beamforming for a MIMO 
channel, i.e., with multiple transmit and receive antennas. 
Taking the rank K = 1 maximizes the diversity gain [31], 
but the corresponding capacity grows only as log7V t instead 
of linearly with Nt, which is the case when K grows pro- 
portionally with Nt. (This is true with both unlimited and 
limited feedback, assuming a fixed number of feedback bits 
per precoder element.) Also, a beamformer is significantly less 
complex than a matrix precoder with K > 1, and requires less 
feedback to specify. 

We again consider an RVQ codebook V with 2 B indepen- 
dent unit-norm vectors, where each vector is uniformly dis- 
tributed over the Nt -dimensional unit sphere. The achievable 

■N t i 
rvqj 



rate is £7jj[Jf5q], where 



I Nt = E, 
1 rvq 



E v 



max log (l + p\\Hvj\ 

\<j<2 B K J 



log(l + p max 

i<i<2 f 



\Hv 3 f) 



H 



H 



(10) 



(ID 



As for the MISO channel, with unlimited feedback the achiev- 
able rate increases as \og(pN t ). We again define the rate 
difference due to quantization as 



l£a = C " log(pJV t ) = E X 



7-A 



max 7j 

3 



where 



7j 



N t 1 3 



H 

(12) 

(13) 



Evaluating the expectation in dT2t is difficult for finite Nt, 
N r , and B, so that we again resort to a large system analysis. 
Namely, we let N t , N r , and B each tend to infinity with fixed 
B = B/N t and N r = N r /N t . For each N t and N r the channel 
matrix H is chosen as the N r x Nt upper-left corner of a 
matrix H with an infinite number of rows and columns, and 
with i.i.d. complex Gaussian entries. 

The received power in this large system limit is given by 



7r7q 



lim 

(N t ,N r ,B)^c 



max 

l<j<2 E 



7j 



H 



(14) 



where convergence to the deterministic limit can be shown in 
the mean square sense. Conditioned on H, the are i.i.d. 
since the beamforming vectors Vj are i.i.d., and applying [32, 
Theorem 2.1.2], it can be shown that 



F 



j\H 



1 



(15) 



where F^fj(-) is the cdf of -fj given H. Analogous results 
for the interference power in CDMA with quantized signatures 
have been presented in [21], so that we omit the proofs of 
(HU and (Q3]i. Note that N r < 7^ < (1 + \f~Nr) 2 , where the 
lower and upper bounds correspond to B = and B = 00, 
respectively. That is, (1 + \/ N r ) 2 is the asymptotic maximum 
eigenvalue of the channel covariance matrix j^H^H [33]. 
The asymptotic rate difference is given by 



1 A = 

rVq (N t ,N r ,B)^oo rVq 



lim 



^rvq = l0g(7 l °° 



rvq; 



(16) 



The limit in ( TT3T > can be explicitly evaluated, and is inde- 
pendent of the channel realization H. 



Theorem 3. For < B < B*. 7^ satisfies 



T Li I — 

e 



N r 



and for B > B*, 

7 ~ = (1 + VKf - exp { ^N r \og{N r ) 

- (N r ~ 1) log(l + ^/%) + yfW r - B log(2)} 



(17) 



where 

B* = 



log(2) 



AUos 



'N r 



'N r 



'N r 



(18) 



(19) 



The proof is given in Appendix [C] and is motivated by an 
analogous result for CDMA, presented in [34]. As stated in 
Theorem [3] 7^ depends only on B and N r . Letting N r — * 
gives the the asymptotic capacity of the MISO channel with 
RVQ. As for the MISO channel, RVQ is asymptotically 
optimal. 

Theorem 4. As (N t ,N r ,B) -> 00 with fixed N r = N r /N t 
and B = B/N t , 

lim sup 1^ - log(pN t ) < 1% (20) 

(N t ,N T ,B)^>ao * 

for any sequence of codebooks {Vj\r t }. 

The proof is similar to the proof of Theorem 2 in [21] and 
is therefore omitted. 



C. Numerical Results 

Figs. |2] and show 2^ for MISO and MIMO channels, 
respectively, with beamforming and RVQ versus normalized 
feedback bits (B) with p = 5 and 10 dB. Also shown for 
comparison are achievable rates with a quantization code- 
book optimized via the Lloyd-Max algorithm [1], [5], [6], 
and the capacity with perfect beamforming, corresponding to 
unlimited feedback. The results for RVQ are averaged over 
codebook realizations, and are essentially the same as those 
shown for the optimized Lloyd-Max codebooks. For the MISO 
channel, the asymptotic capacity (O accurately predicts the 
simulated results shown even with a relatively small number 
of transmit antennas (Nt = 3 and 6). For the MIMO results 
N r = 1-5, and simulation results are shown for 4x6 and 
16 x 24 channels. The asymptotic results accurately predict 
the performance for the larger channel, and are somewhat less 
accurate for the smaller channel. 

Comparing finite feedback with perfect beamforming, the 
results show that one feedback bit per complex entry (B = 1) 
provides more than 50% of the potential gain due to feedback. 
For both the MISO and MIMO examples shown, the perfect 
beamforming capacity is nearly achieved with two feedback 
bits per complex coefficient. 
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Fig. 2. Asymptotic and simulated rate differences versus feedback bits for 
a MISO channel with beamforming. 
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Fig. 3. Asymptotic and simulated rate differences versus feedback bits for 
a MIMO channel with beamforming (N r = 1.5). 



IV. Precoding Matrix with Arbitrary Rank 



the RVQ codebook for beamforming. Letting 

J f r = W l ° Sdet ( lN " + ^ HV ^ H ' 



(22) 



the receiver again selects the quantized precoding matrix, 
which maximizes the mutual information 



V — arg 



For finite N r , we define 



l<j<2 B 



J, 



N r 



I%=Ev[ m^J^H] 



ilogdet^ + ^VV 



tift 



H 



(23) 

(24) 
(25) 



and the average sum mutual information per receive antenna 
with B feedback bits is then En[Inq}- 

Here the power allocation over channel modes is "on- 
off". Namely, active modes are assigned equal powers. This 
simplifies the analysis, and it has been observed that the 
additional gain due to an optimal power allocation (water 
pouring) is quite small [8]. 

Since the entries of the RVQ codebook are i.i.d., the mutual 
informations jf r ,j = 1, • ■ ■ , 2 s , are also i.i.d. for a given H. 
In principle, the large system limit of I^X can be evaluated, 
in analogy with ( fTBI l, given the cdf of Jf r given H, denoted 
as Fj. Nr \ H . This cdf appears to be difficult to determine in 
closed-form for general (N r , N t , K), so that we are unable to 
derive the exact asymptotic capacity with RVQ. Still, we can 
provide an accurate approximation for this large system limit. 
Before presenting this approximation, we first compare the 
capacity with no channel information at the transmitter (B = 
0) to the capacity with perfect channel information (B = oo). 

If B = 0, then the optimal transmit covariance matrix 
VV' = I Nt and K = N t [35]. That is, all channel modes 
are allocated equal power. As (N t ,N r ) — > oo with fixed 
TV,. = N r /N t , the capacity per receive antenna is given by 



In this section we consider the performance of a single-user 
MIMO channel with precoding matrix V having rank K > 1 . 
We wish to determine the asymptotic capacity with RVQ as 
in the previous section. Here we consider the large system 
limit as (N t , N r ,B, AT) — > oo with fixed ratios N r = N r /N t , 
B = B/N*, and K = K/N t . That is, we scale the rank 
of the precoding matrix with N t . The number of feedback 
bits is normalized by N*, instead of N r , since the feedback 
must scale linearly with degrees of freedom (in this case the 
number of channel elements N r N t ).) Given a fixed number 
of feedback bits per channel coefficient, the capacity grows 
linearly with the number of antennas (N t or N r ). 

Given a rank K < N t , the precoding matrix is chosen from 
the RVQ set 

{V r l< ] <2 B } 1 (21) 



V 



1 



Vr iogdet^ r + ^rat 



log (1 + pA) 5 (A) dA 



Xrvq(B = 0) 



(26) 
(27) 



where convergence is in the almost sure sense, and g(X) is the 
asymptotic probability density function for a randomly chosen 
eigenvalue of j--HH^ , and is given by [33] 



g{ X) = ^EMER for a <A<6, (28) 



1 



27rAA r 
' N r \ and b = 1 1 



N, 



(29) 



for N r < 1. The integral in ( I26l l has been evaluated in [36], 
which gives the closed-form expression 



where the entries are independent Nt x K random unitary 
matrices, i.e., VjVj = Ik- This codebook is an extension of 



Irvq(S = 0) = logpy- 



1-N r 
N r 



log 



1 



1 - z 



- — (30) 

N r 
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where 



„ = i ( 1 + N r + - + 
2 I p 



: = ill + N r +-- 
2 I p 



i - .V, + - ) - -I V, ) (31) 



1 + N r + - ) - 4N r | . (32) 



If B = oo, then the X columns of the optimal V are the 
eigenvectors of the channel covariance matrix corresponding 
to the K largest eigenvalues. As (N t , N r ,B) — > oo, we have 



2rvq(-B = oo) = 
where 77 satisfies 

.9(A) dA 



log (l + j=:A) 51A) <L\ (33i 



r #\ 

min] 1, 1 

1 N r 



(34) 



for iV r < 1. We emphasize that this corresponds to a uniform 
allocation of power over the set of K active eigenvectors. (This 
result has also been presented in [8].) The rank of the optimal 
V, or optimal multiplexing gain, is at most mm{N t , N r } and 
can be obtained by differentiating d33l > with respect to K. It 
can be verified that X rv q(B = 0) < T r vq{B = 00). 

To illustrate the increase in capacity with feedback, in Fig. [4] 
we plot the rate ratio I n0[ (B = 00) /T rv q(B = 0) versus SNR 
for different values of N r , where l rv q(B = 00) is optimized 
over rank K. For large SNR p, we can expand 



£vq(£ = 0)=log(p)+ O (log(p)) 

l rvq (B = 00) = log(p) / g(X) dA + o(log(p)). 

Jr) 

Therefore 



lim ^ 



00 



5(A) dA 



mm 1 , -=- [ 



(35) 
(36) 

(37) 



W B = °) 

which implies that the optimal rank K* = min{A t , N r }, and 
the corresponding asymptotic rate ratio is one. The increase 
in achievable rate from feedback is small in this case, since 
for large SNRs, the transmitter excites all channel modes, and 
the uniform power allocation asymptotically gives the same 
capacity as water pouring. Of course, although the increase in 
rate is small, feedback can simplify coding and decoding. 

For small p, we can expand log(l + pX) and log(l + pX/K) 
in Taylor series. Taking p — * gives 



00 



rvq(-B 



1 

K 



0) 

1 [°°A 3 (A)dA 
Ag(A)dA < -^- -700 ' — 
, ^ ' -N r f™g(X)dX 



(38) 



< 



6/*g(A)dA 
N r I h n 9(X)dX 




where b is the asymptotic maximum eigenvalue given by 
The inequality in d38l follows from d34l l. which implies K > 
N r J°° g(X) dA. Note that (f39b corresponds to allocating all 



transmission power to the strongest channel mode, which is 
known to maximize capacity at low SNRs. The maximal rate 
ratio d39l l can also be obtained from Theorem [3] 

The rate increase due to feedback is substantial when N r 
is small, and the rate ratio tends to infinity as N r — > 0. This 
is because the channel becomes a MISO channel, in which 
case the capacity is a constant with B = and increases as 
\og(pN t ) with B = 00. 




5 10 
p(dB) 

Fig. 4. The rate ratio l nq (B = oo)/2 rvq (B = 0) versus SNR (dB) for 
various values of N r . 



To evaluate the asymptotic capacity with arbitrary B, we 
approximate J^ r given H as a Gaussian random variable. 
This is motivated by the fact that J^ r is Gaussian in the 
large system limit [37], since HV is Ltd. Conditioning on H 
introduces dependence among the elements of HV; however, 
numerical examples indicate that the Gaussian assumption 
is still valid for large Nt and N r . Alternatively, if we do 
not condition on H, then the rates {Jj Vr } are dependent. 
Application of the results from extreme statistics, assuming 
the rates {Jf r } are independent, gives an upper bound on the 
asymptotic achievable rate (e.g., see the proof of Theorem 2 
in [21]). This is illustrated by subsequent numerical results. 

Evaluating the large system limit of I rv q , assuming that the 
cdf of jf r is Gaussian, gives the approximate rate 



Zrvq = [i j + cr./V 2B log 2 



(40) 



independent of the channel realization, where pj and <jj 



are the asymptotic mean of J ? - r , and variance of N^Jj ■ 
respectively. The derivation of (l40l is a straightforward exten- 
sion of [32, Sec. 2.3.2] and is not shown here. As B — > 0, 
this approximation becomes exact. However, as B — > 00, 
the approximate rate 2" rvq — * 00, whereas the actual rate 
Znq(B = 00) is finite, and can be computed from ( f33b and 
(l34l >. This is because J^ r is bounded for all N r , whereas a 
Gaussian random variable can assume arbitrarily large values. 
Therefore the Gaussian approximation gives an inaccurate 
estimate of X rvq for large B. (This implies that we should 
approximate X rv q as min{X rv q,I rv q(i? = 00)}.) 
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The asymptotic mean and variance of Jj are computed in 
Appendix [D] The asymptotic mean is given by 



fij 



K ( N r N r 
LOS 1 H — ^p ^pv 



N r 

log ( 1 + p - -j^-pv 



(41) 



where 



K 



1 

V ~ 2 + 2N~ r 



K 
2N r p 




The asymptotic variance is approximated for < K = N r < 1 
and small SNR (p < -5dB) as 



P 2 (l-N r ). 



(43) 



The asymptotic variance for moderate SNRs and normalized 
rank K ^ N r can be computed easily via numerical simula- 
tionQ 

In contrast with the beamforming results in the preceding 
section, we are unable to show that RVQ is asymptotically 
optimal when the precoding matrix has arbitrary rank. The 
corresponding argument for beamforming relies on the evalu- 
ation of the asymptotic rate difference X™. Since here we 
are unable to evaluate X rvq exactly, we cannot apply that 
argument. Nevertheless, numerical results have indicated that 
the performance of RVQ matches that of optimized codebooks 
(e.g., see [22]). 

Fig. shows Irvq with normalized rank K = N r versus 
B for p — —5,0,5 dB and N r = 0.5. The dashed lines 
show the unlimited feedback capacity I n0i (B = oo), which is 
computed from (|33l with optimized K. The asymptotic rate 
with RVQ is computed from d40l . where aj for p = — 5 dB is 
approximated by (1431 1. and a,j is determined from simulation 
with N t = 20 for p = and 5 dB. Also shown in Fig. [5] are 
simulation results for X rvq with N t = 8 and N r = 4. Because 
the size of the RVQ codebook increases exponentially with 
B, it is difficult to generate simulation results for moderate to 
large values of B. Hence simulation results are shown only 
for B < 0.8. The asymptotic results accurately approximate 
the simulated results shown. The accuracy increases as the 
feedback B decreases. 

Since I rvq is a function of both rank K and feedback B, for 
a given B, we can select K to maximize X rv q. Fig. [6] shows 
mutual information per receive antenna versus normalized 
rank from (gOjl with N r = 0.2, p = 5 dB, and different 
values of B. {aj is obtained from numerical simulations.) 
The maximal rates are attained at K = 1, 0.3, and 0.2 for 
5 = 0, 0.5, and 2, respectively. In general, the optimal rank 
is approximately N r for large enough B and SNR. The results 
in Fig. [6] indicate that taking K = N r achieves near-optimal 
performance, independent of B when B > 0. As B increases, 
the rate increases and the difference between the rate with 
optimized rank and full-rank (K = 1) also increases. For the 

'We note that the simulation needed to compute this variance is much 
simpler than the simulation, which would be required to obtain the RVQ rate 
directly, especially with a moderate to large number of feedback bits. 




Fig. 5. Sum mutual information per receive antenna with RVQ and an optimal 
receiver versus normalized feedback. The asymptotic approximation is shown 
along with Monte Carlo simulation results for an 8 X 4 channel. 



N /N = 0.2; N = 1 0; SNR = 5 dB 




Fig. 6. Mutual information per receive antenna versus normalized rank with 
different normalized feedback. Discrete points correspond to simulation with 
N t = 10. 



example shown, the rate increase from selecting the optimal 
rank is as high as 50% when B = 2. 

V. Quantized Precoding with Linear Receivers 

In this section we evaluate the performance of a quantized 
precoding matrix with linear receivers (matched filter and 
MMSE), and compare with the performance of the opti- 
mal receiver. As B — > oo, the optimal precoding matrix 
eliminates the cross-coupling among channel modes, and the 
optimal receiver becomes the linear matched filter. Hence the 
corresponding achievable rates should be the same in this 
limit. However, for finite B the optimal receiver is expected 
to perform better than the linear receiver. Given a target 
rate, increasing the feedback therefore enables a reduction in 
receiver complexity. 

We again assume that there are K independent data streams, 
which are multiplexed by the linear precoder onto N t transmit 
antennas. To detect the transmitted symbols in data stream k, 
the received signal y is passed through the N r x 1 receive 
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filter c k . The matched filter is given by 

1 

Cfc = —p=Hv k 



K 



(44) 



where v k is the fcth column of the precoding matrix V, and 
the linear MMSE filter is given by 

-l 



Cfc = 



1 



IK \K 

The SINR at the output of the linear filter c k is 



SINRi 



clHv k \ 



4 (E^k H^Ht + KalI Nr ) c k 



(45) 



(46) 



Of course, the interference among data streams can signifi- 
cantly decrease the channel capacity. 

The performance measure is again mutual information be- 
tween the transmitted symbol x k and the output of the filter 
c k , denoted by x k . In what follows, we assume independent 
coders and decoders for each data stream. Assuming that the 
interference plus noise at the output of the linear filter has a 
Gaussian distribution, which is true in the large system limit to 
be considered, the sum mutual information of all data streams 
per receive antenna is given by 



R 



N r 



1 K 

r k=l 
1 K 

= T r E 1 °g( 1+ ^)- 



(47) 



(48) 



k=l 



where j k is the SINR for the kth data stream. Given a channel 
matrix H, the sum rate R Nr depends on the precoding matrix 
V. We are interested in maximizing R Nr subject to the power 
constraint \\v k \\ < 1, Vfc, assuming that the power is allocated 
equally across streams. 

Given the codebook of precoding matrices V = {Vj, 1 < 
j < 2 s }, the receiver selects the precoding matrix 



V = arg max R N -(Vj). 

l<j<2 B 



(49) 



We again consider RVQ, in which the Vj's are i.i.d. unitary 
matrices. 

A. Matched filter 

Substituting 04) into 06), the SINR at the output of the 
matched filter is given by 



7fc:mf 



UH\Hv k ) 



Kal(vlWHv k )+j:l 1Mk ^ k 



(50) 



where subscript k denotes the kth data stream. The average 
sum rate per receive antenna is given by 



E 



H,V 



K 



fc=i 



(51) 



where the expectation is over the channel matrix and code- 
book. Since the pdf of R^ is unknown for finite (N t , N r , K), 
we are unable to evaluate ( 151) . Motivated by the central limit 
theorem^ in what follows we approximate the cdf of R^f as 

2 The terms in the sum in \5l\ are not i.i.d., which prevents a direct 
application of the central limit theorem. 



PDF for R , ■ N = 10; N/N = 1; K/N =0.3;SNR = 5dB 

mf;N r r t t 



■ Empirical pdf 
Gaussian approximation 



* |t i 
ill • 
» t » « 



» t 
» i 



* » 



0.2 



0.4 



0.6 



0.8 



Fig. 7. Comparison of the empirical pdf for -R m f ; jv,. with the Gaussian 
approximation. 



Gaussian. The mean is taken to be the asymptotic limit 



(N t ,N r ,K)^>ao 



K 
~N, 



log 1 



K(l + al) 



(52) 

This limit follows from the fact that 7fc ;m f converges almost 
surely to [7^(1 + a^)/N r ]~ x as (N t , N r , K) -> oo with fixed 
N r and K. As for the optimal receiver, 



iV r 2 var[i^|^-] -h. a, 



mf 



(53) 



where af n j can be easily obtained by numerical simulations. 

The accuracy of the Gaussian approximation for R^f is 
illustrated in Fig. [7] which compares the empirical pdf with the 
Gaussian approximation for N r = 10, N r = 1, K/N r = 0.3 
and SNR = 5 dB. The difference between the empirical and 
asymptotic means vanishes as (N u N r , K) — > oo. 

We wish to apply the theory of extreme order statistics [32] 
to evaluate the large system limit 



K 



rvq:mf 



lim [ max IQ{Vj)\V\. 



(54) 



Given V, the sum rates {R*f (Vi), . . . , R^f (V 2 b)} are identi- 
cally distributed. However, the R^f(Vj)'s are not independent 
since each depends on H. This makes an exact calculation 
of the asymptotic rate difficult. Nevertheless, for a small 
number of entries in the codebook (small B), assuming that 
the rates for a given codebook are independent leads to 
an accurate approximation. We therefore replace the rates 
R^[(Vj), j = 1, ■ • • ,2 s , with i.i.d. Gaussian variables with 
mean /i m f and variance a^/N^. In analogy with the analysis 
of the optimal receiver in the preceding section, this gives the 
approximate asymptotic rate 



^rvq ; mf = A 4 mf + ^mf V 2-B log 2. 



(55) 



Numerical results, to be presented, show that this asymptotic 
approximation is very accurate for small to moderate values 
of normalized feedback B. As B — > 0, this approximation 
becomes exact. However, as B — > oo, 7?. rv q ;m f — > oo, whereas 
7£. rv q :m f with B = oo is the same as the asymptotic rate with 
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RVQ and an optimal receiver, given by ( T33T > and d34l ). Hence 
7?. rv q :m f with B = oo is finite. As for the analysis of the optimal 
receiver, this discrepancy is again due to the fact that the cdf 
of R^[, which has compact support, is being approximated 
by a Gaussian cdf with infinite support, and also because the 
dependence among the sum rates R m [(Vj) is being ignored. 



B. MMSE receiver 

Substituting dH) into (06]l gives the SINR at the output of 
MMSE receiver for the fcth symbol stream 

7fc;mmse = v\rf I ]T fft^tfft + Ka 2 n I Nr I Hv k . 

Wfc J 

(56) 

As for the matched filter receiver, given a codebook V, we 
approximate the pdf of the instantaneous sum rate 

1 K 

•Rmmse = Jf log ^ + 7fc;mmse) (57) 
r k=l 

as a Gaussian pdf with mean 



N /N = 0.75; K/N = 0.5; SNR = 5 dB 



K 



Mmmse = lim R m mse = ~TT lo g(l + 7mmse) (58) 

(N t ,N r )->tx> N r 

where the large system SINR is given by [30] 

1 - K/N r 1 



7mmse 



2al 



'{l-K/Nry l + K/N r 1 



(59) 



4<7 4 



As for the matched filter, the asymptotic variance cr^mse can 
be obtained via numerical simulation. 

In analogy with ( f55l ), the asymptotic rate with RVQ and the 
MMSE receiver is given by 



Atmmse + cr mmS e \J 2B log 2. (60) 
As for the matched filter receiver, when B is large, 7£. mmS e 



over-estimates lZ n 



For B = oc, 7?. mmS e = TZ 



mf 



with the optimal receiver, given by d33l . 



!-rvq 



C. Numerical Results 

Fig. [8] compares the approximation for asymptotic RVQ 
performance with a matched filter receiver from (l55l l with 
simulated results for N t = 12, iV r = 0.75, K/N t = 1/2, and 
SNR = 5dB. Also shown for comparison are the asymptotic 
rate for RVQ with an optimal receiver, derived in Section QV] 
the water-filling capacity (B = oo), and the rate achieved with 
a scalar quantizer for each coefficient!! For the case shown, 
the analytical approximation gives an accurate estimate of the 
performance of the finite size system with limited feedback. 
The capacity with the water-filling power allocation is only 
slightly greater than that achieved with the on-off power allo- 
cation. The optimal receiver requires B w 0.6 bit/dimension 

3 For the scalar quantization results the available bits are spread evenly 
over the corresponding fraction of precoding coefficients. The remaining 
coefficients are set to one. 



Water-filling capacity {large sys.) 

L. ....... 






RVQ w/ Opt Rx. 




(large sys.)^ 

~ * ' : 

9* 


RVQ w/ MF Rx. 
(large sys.) 


RVQ w/ MF Rx. 
_ J6*^~~~ (N,= 12) 






: Scalar quantization (N = 12) 



o 0.2 o.4 o.6 o.e 



1 , 1.2 1.4 1.6 1.1 

B/N z 



Fig. 8. Sum rate per receive antenna versus normalized feedback bits with a 
matched filter receiver. Results are shown for RVQ (asymptotic and Nt = 12) 
and scalar quantization. Also shown are results for the optimal receiver, and 
the water-filling capacity with infinite feedback. 



N IN, = 0.75; K/N = 0.5; SNR = 5 dB 




Fig. 9. Sum rate per receive antenna versus normalized feedback bits with a 
linear MMSE receiver. Results are shown for RVQ (asymptotic and Nt = 12) 
and scalar quantization. Also shown are results for the optimal receiver, and 
the water-filling capacity with infinite feedback. 



to achieve the capacity corresponding to unlimited feedback 
d33l . whereas the matched filter requires 1.2 feedback bits per 
dimension to reach that capacity. For other target rates, these 
curves illustrate the trade-off between feedback and receiver 
complexity. 

Fig.|9]shows the same set of results as those shown in Fig. [8] 
but with an MMSE receiver. These results show that for the 
parameters selected, the MMSE receiver performs nearly as 
well as the optimal receiver, and requires substantially less 
feedback than the matched filter to achieve a target rate. 
Again the asymptotic approximation accurately predicts the 
performance of a system with a relatively small number of 
antennas. 

VI. Conclusions 

We have studied the capacity of single-user MISO and 
MIMO fading channels with limited feedback. The feedback 
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specifies a transmit precoding matrix, which can be optimized 
for a given channel realization. We first considered the perfor- 
mance with a rank-one precoding matrix (beamformer), and 
showed that the RVQ codebook is asymptotically optimal. 
Exact expressions for the asymptotic mutual information for 
MISO and MIMO channels were presented, and reveal how 
much feedback is required to achieve a desired performance. 
For the cases considered, one feedback bit for each precoder 
coefficient can achieve close to the water-filling capacity. 
Perhaps more important than the increase in capacity provided 
by this feedback is the associated simplification in the coding 
and decoding schemes that can achieve a rate close to capacity. 

The performance of a precoding matrix with rank K > 1 
was also evaluated with RVQ. Although numerical examples 
and our beamforming results (K = 1) suggest that RVQ is 
also asymptotically optimal in this case, proving this is an 
open problem. To compute the asymptotic achievable rate for 
RVQ with both optimal and linear receivers, the achievable 
rate with a random channel and fixed precoding matrix is 
approximated as a Gaussian random variable. The asymptotic 
rate then depends on the asymptotic mean and variance of this 
random variable. Although the asymptotic variance appears to 
be difficult to compute analytically, it can be easily obtained 
by simulation. Numerical results have shown that the resulting 
approximation accurately estimates the achievable rate with 
limited feedback for finite-size systems of interest. 

Numerical examples comparing the performance of opti- 
mal and linear receivers have shown that the linear MMSE 
receiver requires little additional feedback, relative to the 
optimal receiver, to achieve a target rate close to the water- 
filling capacity. The matched filter requires significantly more 
feedback than the MMSE receiver (more than 0.5 bit per 
degree of freedom for the cases shown). At low feedback rates 
the achievable rate with RVQ is generally much greater than 
that associated with scalar quantization. Of course, this comes 
at a price of high complexity, since the receiver is assumed 
to compute the performance metric for every entry in the 
codebook. Other reduced complexity schemes for quantizing 
a beamforming vector are presented in [21], [38], [39]. 

Key assumptions for our results are that the channel is 
stationary and known at the receiver, and that the channel 
elements are Ltd. Depending on user mobility and associated 
Doppler shifts, the channel may change too fast to allow reli- 
able channel estimation and feedback. In that case, feedback of 
channel statistics, as proposed in [15]— [18], [40], can exploit 
correlation among channel elements. The design of quanti- 
zation codebooks for precoders, which takes correlation into 
account, is addressed in [40]. The effect of channel estimation 
error on the performance of limited feedback beamforming 
with finite coherence time (i.e., block fading) is presented in 
[41], [42]. 

We have also assumed that the channel gains are not 
frequency-selective. Limited feedback schemes for frequency- 
selective scalar channels are discussed in [43], and could 
be combined with the quantization schemes considered here. 
Finally, the approach presented here for a single-user MIMO 
channel can also be applied to multi-user models. Quantization 
of beamformers for the MIMO downlink have been considered 



in [26]-[28]. In that scenario the potential capacity gain due 
to feedback is generally much more than for the single-user 
channel considered here. The benefits of limited feedback for 
related models (e.g., frequency-selective MIMO downlink) are 
currently being studied. 



Appendix 



A. Proof of Theorem [JJ 



Given h, the receiver selects the quantized beamforming 
vector v to maximize the instantaneous rate in ([3J. Since log 
is monotonically increasing, the quantized beamforming vector 
is given by 



arg max 

1<J<2 B 



{y.H^X/NI 2 }. (6i) 



Since the codebook entries Vj, j = 1, • • • , 2 s , are Ltd., the 
Yj's, given h, are also i.i.d. with the cdf [25] 



F Y \h{v) = i - (i - v) 



N t -1 



< y < 1. 



(62) 



We wish to determine the distribution of max., Yj given h. 
From [32, Theorem 2.1.2] it follows that 



maxj Yj 



y 



(63) 



where y is a Weibull random variable having distribution 

H ^ = { expH-zn, l<0 ' (64) 

T> denotes convergence in distribution, and a n and b n are nor- 
malizing sequences, where n — 2 B . Specifically, the theorem 
requires that ^(^1^) — sup{y : FY\h(y) < 1} be finite, 
and that the distribution function Fy^ h (y) — F Y \h(u(F Y \h) — 
1/y), y > satisfies, for all y > 0, 



l-*yih(*w) 



lim ' ]>h \ ' = y 1 

t ^oo 1 - F p [h (t) 



(65) 



where the constant 7 > 0. 

Substituting the expression for Fy\h in J62l into d65l ). where 

u(F Y \h) = 1, gives 



l-*£|hto) 

lim ' T-r- 

t^oo 1-F* ]h (t) 



— lim 

t — >00 

= y~ {Nt 



Nt- 



Nt-l 



(66) 



(67) 



so that [32, Theorem 2.1.2] applies when Nt > 1. Further- 
more, the normalizing constants are given by 



u(F Ylh ) = 1 



and 



K = u(F y \h) ~ inf jy : 1 - F Y \h{y) < 



1 r Y \h 



i-IWi 

n I \n 



(68) 

(69) 
(70) 



SANTIPACH AND HONIG: CAPACITY OF A MULTIPLE-ANTENNA FADING CHANNEL WITH A QUANTIZED PRECODING MATRIX 



11 



To take the limit as N t — > oo, we will assume that the 
channel vector h contains the first Nt elements of an infinite- 
length Lid. complex Gaussian vector h. Rearranging terms in 
and taking the large system limit gives 



(N 



lim E\> [max Yj | h] 

,n)— >co j 

= lim a» + b n E[y] 

(N t ,n)-nx> 

= i - lim ( - ) * 1 T [ I 

(jV t ,n)-K» V nj 



(71) 
(72) 



Nt-l 



(73) 
(74) 

(75) 

where the gamma function T(z) = J °° t z ~ 1 e~ t dt and we 
have used the fact that E[y] = ~r(l - l/(N t - 1)) [44]. 
From (l63l . as (ri, Nt) — > oo, 



1- lim 2 "t- 1 

(JV t ,S)^co 

1-2"* 



var[maxl^-|ft.] — fe^varp- 7 ] — * 0. 



(76) 



Since var[^] = T(l-2/(iV i -l))-r 2 (l-l/(7V t -l)) -> 0, it 
follows that var[maxj Yj\h] — > 0. This establishes that given 
h, 

maxF,- ->■ l-2~ 5 (77) 



in the mean square sense. The asymptotic rate difference is 
given by 

= ^Hm^ flog(l + p\\h\\ 2 nm^ Yj)\ - log(pN t ) 

(78) 



(JV„Bp*oo V'" \pN t + N t 



— lim lo 



\h\\ 2 max K- 

l<j<2 B J 



(79) 

= log(l - 2- s ) (80) 
in the mean square sense, since \\h\\ 2 /N t — * 1 almost surely. 

B. Proof of Theorem [2] 

The rate difference associated with codebook Vat, is 



I; 



A 



max log 

1<]<2 B 



1 



1 



log I h max — I/tv, 



iV t |2 



(81) 
(82) 



Taking expectation of the rate difference with respect to h and 
applying Jensen's inequality, we obtain 

^^^f^+^i^m (83) 

H,[^-+E[^-\\h\\ 2 }E[p j (84) 



where the optimal beamforming vector 



'.A r t 



arg max \h'v 

1<3<2 B 



N t \2 
j 1 ' 



H = \htv Nt \ 2 /\\h\\ 2 , and d§4j follows from the fact that \\h\\ 2 
and p are independent [25]. 



We now derive an upper bound for E[\h^v Nt \ 2 /\\h\\ 2 ] 
From (30) in [2] we have 

Yx{\h)v Ni \ 2 > 7 s | \\h\\ 2 = 1 } 



< 



1, 



Nt — l 



0< s < s* 

8* < S < 1 



where 



s* = 1 - 2"^r 



(86) 



(87) 



Since the right-hand side of d86l > is independent of 7, averaging 
over 7 gives 



Pr{p > s} < 



1, < s < s* 

2 B (l-s) Nt -\ s*<s<l 



Integrating by parts, we have that 

E\p] = / Pr{/i > x} dx 
Jo 



/ Pi{p>x}dx+ / Pr{/i > x] Ax. 
Jo Js* 



(88) 



(89) 



(90) 



Substituting d88l into d90l ) and evaluating both integrals gives 



E[fi}< 1-2 "t- 1 + — 2 "t- 1 . 

■iVi 

Substituting -E[|H| 2 ] = N t and (j9T) into (O gives 



£[j£,J < log ( 1-2"^ 



1 „ S_ 1 

— 2 "t- 1 H 



and taking the large system limit gives 



lim E[I$ ] <log(l-2- B ). 



(91) 



(92) 



(93) 



Theorem Q] states that RVQ achieves this upper bound, and 
therefore upper bounds the asymptotic rate difference corre- 
sponding to any quantization scheme. 

C. Proof of Theorem \3\ 

We first prove the theorem for N r > 1. Let z = F~,^j(l — 
2~ B ). Rearranging ( TT~5T > gives 



lim [1 - F„ ]B (zj\ n * = 2 



-B 



(94) 



Next, we derive upper and lower bounds for the left-hand side 
of d94l i and show that they are the same. The derivation of the 
upper bound is motivated by the evaluation of a similar bound 
for CDMA signature optimization in [34]. That is, 

l-F ylS (z)=Pv{^>z\H} (95) 

= Pr {vtUAtfvj > z\A, £/} (96) 

' wtUAU^Wi 



Pr 



> z 



A,U 



(97) 



(85) where jj = jj^VjH^Hvj and we have applied the singular 



value decomposition jj-H^H = UAU\ where U is an 
N t x Nt unitary matrix, A = diag{Ai,--- ,Ajv t }, and the 
eigenvalues are ordered as Ai > A2 > • • • > Aj\r t . Also, Wj 
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is an N t x 1 vector with independent, circularly symmetric, where 
zero-mean and unit-variance Gaussian elements. Both Vj and 
Wj ■■/ 1 1 Wj ] are isotropically distributed, i.e., Uwj / \ \ w j | and 
lUj/HiUjll have the same distribution, so that 



1 - F l]S (z) = 1 - F 7 , A (z) 



Pr 



ENt \ ; 
ill 

^ „,2 



> Z 



( N t 

= Pi ^ -J2( z - X i) w t > 



i=i 
Nt 



{ t=i 

Pr | exp {^pY^{z - Xi)wtj > 1 



(98) 
(99) 

(100) 

Vp > 
(101) 

•1 

(102) 



where {wi} are elements of Wj. (We omit the index j to 
simplify the notation.) Applying Markov's inequality and the 
independence of the Wi's gives 



I - F MA (z) < E {m} 



N t 



exp < -p^2(z- X z )i 



(103) 



N t 



]jE Wi [exp {-p{z - Xijwf} | A,] (104) 

Nt „oo 

Yl exp{-p(z- Xi)x}e- X dx (105) 
i=i J ° 

Nt ^oo 

Y[ exp{-(l +p(z- Xi))x}dx (106) 
Jo 



i=l 



TT 1 

11 1 + p ^ 



p(z - Ai) 



(107) 



= exp|-^log(l + p(z-A l ))| (108) 

when 1 + p(z — A«) > for all i, or p < l/(Ai — z). Taking 
the large system limit, we obtain 

lim [1 - F 7 , A (z)] ^ < exp{-$( 7r T q , p)} (109) 



for < p < 



■6 



where 



Wq,P) = / log(l + P(7rvq - A))<?(A)dA, (110) 



a 



g(X) is given by (|28)-(|29]), and A~ ax = lim (JVt ,iv r )-oo Ax = 
(1 + \jN r ) 2 . To tighten the upper bound, we minimize ( 1109t 
with respect to p, i.e., 

lim [l-F 7 | A (z)]^<exp{-$( 7 ~,p*)} (111) 

(Nt,N r )—KX> 



p*=arg max *(t£,,p). (112) 

0<P< ,oo ^ 



A similar expression for RVQ performance when used to 
quantize signatures for CDMA is derived in [34]. 

To derive the lower bound, we use a change of measure. 
(A similar approach was used in [45, Section 1.2].) Let yi = 
(Ai — z )wf, which is a scaled exponential random variable 
with cdf Fi(-). We define the new distribution 

Gi(x) -i^L eP * vdFi{y) > (n3) 

so that 

Mi(p*)dGi(x) = ef* x dFi{x), (114) 

where p* is given in ( II 121 ). and the moment generating 
function for y^ is 

Mi{6) 4 E[e ey >] 

B 0(Xi-»)x e -x dx 



1 



l + 0(z- A*)" 
Applying the change of measure dl 141 >. we have 

l-F jlA (z) 



(115) 
(116) 

(117) 



Nt 



= Pr{£> >0} 



(118) 



/"■■■/" lEw>0]dFi(i/i)--dF JVt (^ t ) (119) 

^ ^ i=l 

J—J l^tfi > Oje-^^W^diMs/i) d 2 0) 



i=l 

Aft - AT, 



nM,(p*) / ••• j l£>>0]e-'*£2i« dGxd/i) 

1 2—1 

dG Nt (y Nt ) 



(121) 



where 

For any e > 0, 

1 - F 7 | A (z) 

AT, 



l[x > 0] 



1 : x > 
: a; < 



AT t 



(122) 



> f[ Mi{p*) f ■ ■ ■ f l[eN t > £ Vl > 0}e-e' 
i=i J J »=i 



(123) 



dGi(yi) • ■■dG Nt (y Nt ) 



N, 



N t 



> [] M^p^e-"*^ /.../l[dV t >^ W >0] (124) 

i=l ^ ^ i=l 

dGi(yi) • ■ -dG Nt (y Nt ) 

N t N t 

= II ^(P*)e" p * eA,f Pr{eJV t 0} (125) 
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where the yj's are independent random variables with cdf 
Gi(-), and the second inequality follows since p* > 0. To 
determine the probability on the right-hand side of ( 11251 ), we 
first compute the mean of jji, 



rm= I ydGi(y) 
1 



Mi{p*) 



ye p y dFi(y) 



(126) 
(127) 



(1 + p*{z - A,)) / (Aj - z)xe p " {x ^ z)x e- x dx 
Jo 

(128) 

A ' : (129) 



1 + p*(z- Xi) 



Therefore 



1 N t 1 N t 

vX mi = NXl 



Xi - z 



and the asymptotic mean 



t r-f 1 + p*(z - Xi) 

2—1 V ' 



(130) 



1 

lim — > 

AU->oo AT* ^ 



lim 

z ^7rv 



A 



/rvq 



-<?(A) dA < CO. 



(131) 



(132) 



L 1 + P*(7r^q-A)' 

Similarly, since jji is exponentially distributed, the variance of 

Vi is 

A, - z 



l + p*(z- Xi) 
and the asymptotic variance 

1 Wt 

cr 2 = lim — > of 

00 (Ar t ,AT r )-oo 2V t 1 



< oo 



(133) 



(134) 



z ^Trvq 
b 



A -7, 



OO 

rvq 



a Vl + P*(7r£,-A) 



5 (A)dA<oo. (135) 



Both the asymptotic mean and variance are finite. 

Since the jfj's are independent with finite mean and variance, 
the central limit theorem implies that the cdf for 



T 



i=l Vi 2-ii=l ' 



(136) 



converges to a Gaussian cdf with zero mean and unit variance. 
Therefore we have 



Pr{0< ^2 Vi<eNt} 



i=l 



= Pr{- VN t a Nt <T <-^/N t a Nt + y/N t e—} (137) 

°Nt 

= F T (-^/N t a Nt + ^N t e^-) - F T {-^fW t a Nt ) (138) 

0N t 



where Ft{-) is the cdf for T and 



a Nt = 



EfJi^M 



Nt 



\ i=i 



(139) 



(140) 



Let </>(•) denote a Gaussian cdf with zero mean and unit 
variance. We can rewrite d 1 38b as 

N t 

Pr{0 < J^yi < * N t} = H-VNta Nt + \/N t e—) 

'—^ o Nt (141) 



N t a Nt ) + ( Nt - £,N t 



where 



( Nt = F T (-VN t a Nt + y/N t e—) 
- <j>{-^fW t a Nt + ^/A^e-L), 

ON t 

Z Nt = F T (-y/N~ t a Nt ) - <P{-^fN~ t a Nt ). 



(142) 
(143) 



Applying the Berry-Esseen theorem [46], we can bound 
both C,N t and £/v t for large N t as 



ICatJ, |fo|< 



c 



(144) 



where C is a positive constant that depends on the variance 
and third moment of y. Similar to the mean and variance, we 
can show that the third moment is also finite. 
We can now evaluate 

1 



cf>(-TjN t a Nt + \/N t e—) - </>(-y/N t a Nt 

0N t 



2tt 
1 



* e"' 2 / 2 dt 

1 



(145) 
(146) 



< c -Nt(a„+-e/bmY JJf tC 

V2n ON t 
Substituting (1144b and (11461 l into ( 11411 1. we have 

N t 

Pr{0 <^2§i< tN t } = 0(l/v/iVt) (147) 
i=i 

Taking the large system limit and applying L'Hopital's rule, 
it follows that 

N t 

lim [Pr{0 < Vy 4 < eN t }}^ = 1. (148) 

{Nt,N r ) — >oo ^ 



1=1 



Taking the N t th root and large system limit on both sides of 
( 1125b gives 

lim [1 - F l]A (z)] * > exp{~$( 7 -,p*)} (149) 

{Nt,N r )->oo 



where we use ( 11481 l and let e — > 0. 
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The lower bound in ( 1149b is exactly the upper bound ( 111 II ). 
Therefore, 



lim [l-F 7 | A (z)]^=exp{-$( 7 -,p*)} (150) 

(N t ,N r )— >oo 
z ^7™ 



(151) 



and the asymptotic RVQ received power satisfies the fixed- 
point equation 

<%rvq,P*)=5l0g(2) (152) 

where p* is given by (11 121 i. The goal of the rest of the proof 
is to simplify (11521 ). 

To determine p* , we first compute 



dp 



/rvq A 



1 + (7rv°q - A)p 



1 1 

- + — 

P P 




5 (A)dA (153) 
5 (A) dA (154) 



= - + \s A (y) (155) 
P P 

where 6>a(-) is the Stieltjes Transform of the asymptotic 
eigenvalue distribution of A. Setting the derivative to zero and 
solving for p gives 

S A (y) = - P = -J— (156) 
7™ q - V 

as the only valid solution. Substituting the expression for 5a 
given in [33] into d 156b gives 



(-1 + N r - y) ± yV - 2(N r + l)y + (N r -Vf 

2y 



(157) 



7rvq-2/ 

which simplifies to the quadratic equation 

(N r - 7 ™ q )y 2 + [7rvq + #r7^ + = 0- (158) 

Solving for y gives y = or y = 7 r ^ q [l + l/( 7 ™ q - 



or equivalently, p = 
p > 0, we must have 



Since 

3 2 $(7rvq,P) 



-!/7™ q or p = - N r )/j^. Since 



P = 



7 r 



vq 



dp 2 



7fSq 

A - 7™ q 



1 + P*(7r?q-A) 



<0, 



(159) 

3 (A) dA (160) 
(161) 



therefore p* achieves a maximum. 

By also evaluating &(jwq>P) at me boundary points p = 

and p = l/(A^ ax - 7r ^ q ), we have 



'rvq 



(l + V^) 2 -7r~q 



, N r < 7r ^ q < N r + \/N r 

, N r + \/W r < 7 ~ < (1 + VNt? 

(162) 



To evaluate ^(^q, p*), we re- write (111 Ob as 

*(7rvq,P*) 
b 



log(p* 



log(p*)+log( (-+7~q) - A 

1 



7(A) dA (163) 



>"■" ! ( 7+^qJ -Al 5 (A)dA. (164) 

To evaluate the integral in ( 1164b . we apply the following 
Lemma. 



Lemma 1. For x > (1 + \/Ar) 5 



G(x)± / log^ - A).g(A) dA 

J a 

= log(w(a;)) + \fW r u{x) - (N r - 1) log ^1 + 

where 



(165) 
(166) 



w{x)= (*-l-Nr) + V( : -l-N r) l-4Nr^ (16?) 



(g - 1 - N r ) - y/(x - 1 - N r f - 4A r 

u{x) = 1 — 7= . (loo) 



2 V N r 

The proof of this Lemma is similar to that given in [36] 
and is therefore omitted here. 

For N r + \f~Nr~ < 7 r ^ q < (1 + \f~N~r) 2 , we substitute p* = 
[(1 + \/~N~r) 2 - 7 r < ? q ]~ 1 into ( fl64l i to obtain 



1>(7rTq,[(l + V^) 2 -7r™]- 1 ) 



rvq- 11 V ••<•' /rvqJ (169) 
2 „,oo 1 1 ci I f'l 1 . / AT ^2 ' 



- l0g[(l + VNrY ~ 7rv°q] + O (1 + VNrY (170) 



1 



= - log[(l + VNr) 2 - 7 ~ ] + ^N r \og(N r ) 

- (N r - 1) log(l + y%) + \fW r (171) 
= Blog(2). (172) 

N r + \/n\- and 



Solving for 7^ gives ( fl8l ). Taking 7^ 
solving for B gives B* in (fT9l . 

For A r < 7r ^ q < N r + \fN~ r , or < B < B\ we substitute 

p* = iSz^L into ( fl64l ) to obtain 



$ 7r<vq, 



/rvq 



N r 



7rvq 



l0g( 7 ^ q " iVr) - l0g( 7 ~) 



+0 7^ + 



(173) 

^ 1^-Nr)- 

To simplify ( 1173b . we let tp = 7^ — N r and re-write (1173b as 



log(^) - log(V + ^V r ) 



+Q[l + N r + i/j + 



After some manipulation we have 



N r \ 


N r 


4> J 




N r \ 


i> 


1> J 


VNr~ 



(174) 

(175) 
(176) 
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and 

e (1 + N r + i> + = io g (7v r ) - logO) 

-(N r -l)logfl + ^\ +i>. 
Substituting (1177b into ( 11741 ). we obtain 



To approximate the variance of Jj r , we substitute a Taylor 



(177) 



$ [ip - N r , 



ip + N r 



ip - N r log 1 + 



± 

Nr 



(178) 



= 7rTq ~ N r - N T log( 7 ~ ) + N r log(JV P ). (179) 

Setting this to £?log(2) and simplifying gives (I17t . 

For A r < 1, the asymptotic eigenvalue density of j^H^H 
is given by 



9 (X) = (1 - N r )6(X) + V(A ' 2 " } A (6 ' A) . (180) 



where a and are given by (l28ll-(|29l. Following the same 
steps again from (II 101 ) gives ( PT81 and ( fT7T >. This completes 
the proof of Theorem [3] 

D. Derivation of (l4Tb-([43T> 
To compute /xj, we first write 



fe=i 



1 + P^ fc 



(181) 



where is the fcfh eigenvalue of Y = -^-HVjV^ . 
As (N t , N r , K) — > oo, the empirical eigenvalue distribution 
converges to a deterministic function i*Y(i). The asymptotic 
mean is given by 



/'./ 



lim E[JH= [ logh+p^f) dF r (t). 

(182) 

A similar integral has been evaluated in [36, Eq. (6)], and the 
result can be directly applied to ( 11821 ). giving ( |4TT >. 

To compute the variance, we express J^ r differently by 
first performing the singular value decomposition H = 
Vh^hU\j, where Vjj is the N r x N r left singular matrix, 
Uh is the Nt x N r right singular matrix, and £jj is an 
7Y r x N r diagonal matrix. Here we assume that Nt > N r . 
(The result for Nt < N r can be shown by a similar approach.) 
We therefore have 



1 



Jf r = ^logdet (I Nr +pALj) 

r i—l 



(183) 



(184) 



where A = -^Sfj, Lj = UjjVjVjUH, and r\i is the ith 
eigenvalue of ALj. To compute var[jj Vr ], correlations be- 
tween pairs of rji's are needed. Although the joint distribution 
of eigenvalues is known, it is complicated, so that computing 
the variance appears intractable. 



series expansion for log(l + Sx) into ( 1 1841 ) to write 



A<7 



J 3 ~ MZ^Vi 9N 2^ 



2 

2N r 



P 

3N r 



N r 

E< 



(185) 



P 2 . r^r^2 1 , P 3 



+ ... (186) 

for /077 max < 1, where ?7 max = maxj r/i, and is the maximum 
eigenvalue of HV W/K. Since //V is N r xK and Ltd., 
?7max has asymptotic value (1 + y/N r /K) 2 . If K/N r = 1, 
then the condition asymptotically becomes p < 1/4 (-6 dB). 
Ignoring the terms of order p 3 and higher, we can approximate 
the variance of jf r at low SNR as 



var[jj Vr ] sw p 2 var 



N^ {AL} 



(187) 



Letting A = diag{Ai} and lij denote the (z, j)th element of 
L, the first term in ( 11871 ) can be expanded as 

var[tr{AZ}|A] = £ A 2 (E[%] - E 2 [l u }) 

i=i (188) 

For a given C/jj and random unitary V with K = N r , The- 
orem 3 in [7] states that L has a multivariate beta distribution 
with parameters N r and N t — N r . (The distribution of L is 
not known for general K.) From Theorem 2 in [47], we have 

A. + l 



E 



Ni 



? (A r + l)(A r + 3) 



(189) 



(190) 



(N t + 2)(N t +A) 

N r (N r + l)(JV t + 4) + (JV r + l)(JV t - JV r + 1) 
(JV t + l)(JV t + 2)(JV t + 4) 
i^J, (191) 
for 1 < i,j < N r . Substituting dT89l>-JT9TT> into (fT88l gives 



var[tr{AL}|A] 




1 



{N r - l)N 3 + O 



N r 



(192) 



Taking expectation with respect to A, and the large system 
limit, we have 



E A (var[tr{A£}|A]) — > 1 — N r . 
Also, in the large system limit 

4— 1 X 



1 

A^ 



- [/ 



1 2 



tdF A (t) 



1 

JVf 



(193) 

(194) 
(195) 
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where F\(t) is the asymptotic distribution for the diagonal 
elements of A or, equivalently, the asymptotic eigenvalue 
distribution of HH.yN r . 

Substituting ( 11931 ) into ( 1187b . we have 

a 2 j = lim Ar r 2 var[jfl (196) 
« p 2 (l-N r )- (197) 
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